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ABSTRACT 


The  investigation  characterised  the  scientific  and  technical  information  needs 
of  1,  500  scientists  and  engineers  from  73  companies,  8  research  institutes, 
and  Z  universities;  and  the  flow  of  scientific  and  technical  information  (flow 
process)  inherent  in  satisfying  these  needs.  Interviewers  asked  63  questions 
in  the  following  four  subject  areas:  (1)  the  USER  of  scientific  and  technical 
information,  (2)  the  user's  most  recent  scientific  or  technical  TASK,  (3)  the 
user's  general  UTILIZATION  of  the  information  system,  and  (4)  the  user's 
SEARCH  AND  ACQUISITION  process  for  information  used  in  task  performance. 

Many  studies  have  been  performed,  and  much  has  been  written,  concerning 
the  flow  process.  The  tendency  has  been  to  examine  only  small  portions  of 
the  process,  or  to  speculate  about  large  portions  of  the  process  in  generali¬ 
ties.  Therefore,  very  little  of  a  comprehensive,  definitive,  and  unifying 
nature  actually  has  been  said  about  the  process.  This  investigation  is  the 
fi'-st  attempt  to  obtain  so  much  data  on  so  large  a  portion  of  the  process,  and 
its  analysts  is  the  first  attempt  to  draw  definitive  and  unifying  conclusions 
from  such  dtn.z.. 


Goals  for  the  flow  process,  future  analysis  of  the  flow  process,  character¬ 
ization  of  the  flow  process,  and  analysis  of  flow-process  data  are  summarized. 
In  addition,  the  goals  and  future  analysis  recommendations  reflect  work  per¬ 
formed  by  the  author  after  completion  of  the  study. 
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INTRODUCTION 


A  major  investigation  to  determine  how  scientists  and  engineers  acquire 
information  ha*  been  recently  completed.  The  objective  of  the  investigation 
was  to  characterise  scientific  and  technical  information  needs,  and  the  flow 
of  scientific  and  technical  information  (flow  process!  renoired  to  satisfy 
these  needs.  The  study's  conclusions  arc  as  important  to  individual  organi¬ 
zations  as  they  are  to  the  government,  and  as  important  to  scientific  and 
technical  management  as  they  are  to  those  directly  concerned  with  the  flow 
process, 

Data  were  obtained  by  personal  interviews,  with  a  representative  sample  of 
1,  500  from  a  population  of  approximately  120.  000  scientists,  engineers,  and 
technical  personnel.  These  personnel  were  employed  by  73  companies, 

8  research  institutes,  and  2  universities.  The  Appendix  lists  participating 
organizations,  with  the  number  of  personnel  interviewed  from  each. 

To  ensure  high-quality  data,  the  interviewers  were  thoroughly  trained,  and 
the  interviews  were  carefully  recorded  and  checked  for  accuracy  and 
consistency.  The  interviewers  asked  63  questions  in  the  following  four 
subject  areas:  (1)  the  USER  of  scientific  and  technical  information,  (2)  the 
user's  moat  recent  scientific  or  technical  TASK,  (3)  the  user's  general 
UTILIZATION  of  the  information  system,  and  (4)  the  user's  SEARCH  AND 
ACQUISITION  process  tor  information  used  in  task  performance. 

Many  studies  have  been  performed,  and  much  has  been  written,  concerning 
the  flow  process.  The  tendency  has  been  to  examine  only  small  portions  of 
the  process,  or  to  speculate  about  large  portions  of  the  process  in  generali¬ 
ties.  Therefore,  very  little  of  a  comprehensive,  definitive,  and  unifying 
nature  actually  has  been  said  about  the  process.  This  investigation  is  the 
first  attempt  to  obtain  so  much  data  on  so  large  a  portion  of  the  process, 
and  its  analysis  is  the  first  attempt  to  draw  definitive  and  unifying  conclusions 
from  such  data.  During  this  analysis,  qualitative  question  responses  were 


transformed  into  numerical  form,  a  process  model  for  relationships  among 
questions  was  constructed  and  estimated,  and  numerical  relationship  results 
were  transformed  back  to  qualitative  form. 

Coals  for  the  flew  process,  future  analysis  of  the  flow  process,  character- 
ir.ation  of  the  flow  process,  and  analysis  of  flow-process  data  arc  discussed 
in  subsequent  sections.  This  discussion  suminarir.ee  the  investigation,  which 
is  completely  riescrihed  in  Reference  1.  In  addition,  the  goals  and  future 
analysis  recommendations  reflect  work  performed  by  the  author  after  the 
publication  of  Reference  1. 

.Reference  t  presents  the  application,  to  a  process  or  system  in  general,  of 
the  recommended  program  for  analysis  and  optimization,  as  well  as  the  first 
two  portions  of  the  analysis.  Computer  programs  used  in  the  analysis  are 
documented  in  Reference  3. 

The  surveyed  organisations  constitute  a  reasonable  cross-section  of  scientific 
and  technical  organisations  i.i  general,  although  they  were  selected  on  the 
basis  of  being  defense  contractors  (see  Appendix).  In  the  absence  of  a  compar¬ 
ably  comprehensive  and  definitive  investigation  of  the  flow  process  in  general, 
it  is  informative-  to  view  the  results  of  the  study  as  generally  indicative,  if 
not  actually  applicable. 

For  this  reason,  the  terminology  employed  here  has  been  selected  to  mini¬ 
mize  dependence  upon  the  defense  industry.  The  correspondence  between  the 
terminology  and  that  of  Reference  1  is  as  follows: 

«  User's  salary  level  replaced  user's  equivalent  GS  rating. 

•  Documentation  Center  replaced  Defense  Documentation  Center. 

«  Government  Information  Center  replaced  DOD  Information  Center. 


GOALS  FOR  THE  B  LOW  PROCESS 

The  conclusion*  of  the  investigation  provide  a  sot  of  goals  for  the  flow  pro¬ 
cess!  and  a  measure  with  which  to  evaluate  a  general  Information  system. 
These  goals  are  supported  by  the  characterisation  of  the  flow  process  below, 
and  the  numerical  results  which  appear  in  Volume  111  of  Reference  1. 

THE  FLOW  PROCESS 

Figure  1  is  helpful  in  visualising  the  goals  described  by  the  remainder  of  this 
section.  It  represents  either  of  the  following  processes' 

•  The  flow  process  in  task  pe  rfornianee.  when  UTILIZATION  repre¬ 
sents  the  utilization  of  the  information  system  in  task  performance. 

a  The  flow  process  in  general,  when  TASK  represents  the  user's 
scientific  or  technical  task  in  general. 


FLOW  PROCESS 


sum 
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RRIDGK  THE  INFORMATION  GAP 


An  information  pap  r*i»tii  between  I  In-  user  of  scientific  and  technical 
information,  ami  tlu-  information  system  which  servos  his  needs.  This 
information  yap  must  he  bridped  if  the  user  is  to  obtain  high-quality 
info  rmat  ion. 

RKORIKNT  THK  USKR  AND  THK  INFORMATION  SYSTEM 

Roth  the  o»c  •  an.,  me  iniormalion  nyxlcm  need  to  he  reoriented.  Scientists 
and  engineers,  cspei  iall}  those  in  management  or  those  possessing  an 
advanced  depree,  must  become  active  seekers  of  high-quality  information 
serviees.  For  its  part,  the  information  system  must  become  an  active 
provider  of  high-quality  information  services,  not  merely  a  passive  document 
repository. 

KXPAND  THK  INFORMATION  BASK 

An  information  base  forms  the  foundation  of  the  information  system.  In 
peneral,  it  contains  information  which  is  conceptual  ami  research-oriented. 
The  information  base  lias  to  be  expanded  to  include  design  and  performance 
information,  and  information  which  is  development-  and  production-oriented. 

RKSTRUCTURK  THK  INFORMATION  BASK 

The  information  base  is  composed  of  information  media  which  convey  the 
information.  Fur  the  must  part,  these  media  are  written  in  form,  format  in 
composition,  and  textual  in  layout.  It  nu.it  he  restructured  to  include  media 
which  are  oral  in  form,  informal  and  semiformal  in  composition,  and  graphi- 
cal  in  layout. 

MAKK  THE  INFORMATION  BASK  FKKXIBKK 

’The  information  base  should  he  made  flexible  to  permit: 

•  Information  to  he  indexed,  abstracted,  selectively  organized,  and 
selectively  analyzed. 
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•  Information  to  be  selectively  repackaged  in  information  media  of 
appropriate  form,  composition,  and  layout. 

•  Information  media  to  be  indexed  and  abstracted. 

MAKE  THE  INFORMATION  BASE  MOBILE 

The  information  base  needs  to  be  made  mobile,  to  that  information  aware- 
nett  it  automatic,  rapid,  and  selective;  and  information  acquisition  is  quick 
and  easy. 

EXPAND  THE  INFORMATION  SYSTEM 

Expert  personnel  mutt  be  employed  to  expand  the  information  system  by 
providing  both  information  resource!  and  connections  with  the  informal 
information  system  ("invisible  colleges").  This  expansion  will  add  an 
entirely  new  dimension  to  the  information  system. 

EXTEND  THE  INFORMATION  SYSTEM 

The  information  system  has  to  be  extended  into  the  local  work  environment 
by  the  automatic  and  selective  dissemination  of  abstracts  for  media  in  the 
information  base,  and  listings  of  disciplinary  areas  with  an  expert's  level 
of  competence  in  each  area. 


FUTURK  ANALYSIS  OF  T!IF  FLOW  PROCKSS* 


The  investigation  has  gone  rated  a  great  deal  of  valuable  data  concerning  the 
flow  process.  Analysis  of  the  data,  despite  funding  and  time  limitations 
inherent  in  an  exploratory  Jtudy,  has  yielded  considerable  insight  into  the 
flow  process.  This  analysis  also  indicates  that  certain  portions  of  the  flow 
process  merit  additional  investigation,  and  that  certain  portions  of  the 
analysis  merit  rclincmcnt. 

A  complex,  butas-yet  incompletely  characterised,  relationship  exists 
between  the  Mow  process  and  the  performance  of  scientific  and  technical 
tasks.  To  improve  task  performance  (improve  quality,  reduce  cost,  or 
shorten  time),  both  the  government  and  individual  organisation#  havr  made 
large  investments  in  improvement  of  the  flow  process  by  improvement  of  the 
information  system  (see  Figure  1),  Sufficient  improvement  in  (optimisation 
of)  the  flow  process  is  achieved,  when  sufficient  improvement  in  (optimisa¬ 
tion  o')  task  performance  is  achieved. 

Therefore,  the  additional  investigation  should  tic  performed  in  the  framework 
of  a  general  program  for  analysis  and  optimization  of  the  flow  process  with 
respect  to  task  periormance. 

The  analysis  provides  the  basis  for  this  program  of  analysis  and  optimiza¬ 
tion  in  the  following  manner; 

•  A  model  of  the  flow  process  with  which  to  plan  investigations  and 
perform  analyses. 

•  An  analytical  approach  to  transform  qualitative  question  responses 
into  numerical  form,  to  construct  and  estimate  a  process  model 
for  relationships  among  questions,  and  to  transform  the  numerical 
relationship  results  back  to  qualitative  form. 

The  application  of  the  first  two  portions  of  the  above-mentioned  analytical 
approach  to  a  process  or  system  in  general  is  contained  in  Reference  2. 


‘‘‘The  future  analysis  recommendations  should  be  assigned  priorities 
according  to  the  twin  criteria  of  objectives  and  available  resources. 
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ADDITIONAL  INVESTIGATION 


Investigation  of  the  following  areas  appears  promising: 

•  The  feasibility  of  the  conclusions,  and  their  effect  upon  the  flow 
process. 

•  The  effect  of  the  quality  and  timeliness  of  information  acquired  upon 
the  quality,  cost,  and  speed  of  task  performance. 

•  The  difficulties  encountered  in  the  utilization  of  the  information 
system,  with  an  emphasis  upon  separating  those  attributable  to 
inoide  the  organization  from  those  attributable  to  outside  the 
organization. 

•  The  users  who,  though  cognizant  of  certain  portions  of  the  informa¬ 
tion  system,  do  not  use  them. 

a  The  utilization  of  the  information  system  in  task  performance. 

•  Those  areas  suggested  by  refined  analysis  of  the  data. 

PROGRAM  FOR  ANALYSIS  AND  OPTIMIZATION 

The  flow  process  (Figure  1)  is  quite  complex,  and  experimentation  (investi¬ 
gation)  regarding  it  is  both  difficult  and  expensive.  For  such  a  process, 
mathematical  solution  for  outputs  in  terms  of  inputs  is  usually  noi  f,  asible, 
anti  computer  simulation  is  often  an  effective  and  efficient  complement  to 
expe  rime  nlation. 

When  a  model  (mathematical  representation)  for  the  process  is  translated 
into  a  simulation  computer  program  (computer  repre 3cntation)  for  the 
process,  the  process  and  the  effects  of  various  factors  upon  it  may  be 
simulated.  The  accuracy  and  precision  of  the  computer  simulation  increase 
as  the  accuruc)  and  precision  nt  the  model  increase,  which  occurs  as 
knowledge  nonce  rning  the  process  increases. 

Four  periods  occur  in  the  evolution  of  a  body  of  knowledge  as  it  matures 
from  an  art  into  a  science;  these  are  description  modeling,  prediction,  and 
control  and  optimization.  Computer  simulation  yields  appropriate  results 
ir.  the  modeling,  prediction,  and  control  and  optimization  periods.  With  the 
completion  of  ihi?  investigation,  knowledge  concerning  tin  flow  process  is 
emerging  from  the  description  period  and  entering  into  the  modeling  period. 


A  program  that  provide*  a  meaningful  framework  for  the  coordination  of 
experimentation  and  computer  simulation  in  the  analysis  and  optimisation  of 
the  flow  process  is  illustrated  by  Figure  i,  and  is  composed  of  the  following 
10  basic  stages: 

1.  Quantitative  process  analysis  develops  a  model  by  transforming 
qualitative  elements  of  the  process  into  numerical  form,  and  by 
constructing  a  model  for  relationships  among  component  part*  of 
the  process.  The  transformation  of  elements  is  accomplished  by 
arranging  the  elements  into  an  informative  detailed  (local)  structure, 
and  then  associating  s  meaningful  number  with  each  element. 
Construction  of  the  model  is  accomplished  by  arranging  the  compo¬ 
nent  parts  into  an  informative  general  (global)  structure,  and  then 
specifying  the  general  form  for  meaningful  relationships  among 
component  parts. 

2.  Experimental  trial  produces  experimental  data. 

d.  Model  estimation  produces,  from  experimental  and  available 

auxiliary  data,  estimates  of  unspecified  constants  in  the  general 
form  of  relationships  in  the  model:  and  a  preliminary,  but  insuffic¬ 
ient,  evaluation  and  validation  (positive  check)  of  the  process 
representation  by  the  model. 


PROGRAM  FOR  ANALYSIS  AND  OPTIMIZATION 
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4.  Simulation  programming  tranalatea  the  model  into  a  simulation 
computer  program. 

5.  Simulation  trial  produces  simulation  data. 

6.  Model  and  simulation  data  comparison  provides  an  evaluation  and 
validation  of  the  model's  representation  by  the  simulation  computer 
program. 

7.  Experimental  and  simulation  data  comparison  provides  an  evaluation 
and  validation  of  the  process'  representation  by  the  combination  of 
model  and  simulation  computer  program.  Thus,  the  process' 
representation  by  the  model  is  indirectly  evaluated  and  validated, 
given  that  the  model's  representation  by  the  simulation  computer 
program  htig  been  validated. 

8.  Experimental  and  simulation  data  analysis  characterizes  and  evalu- 
— ates  the  process,  in  terms  of  criteria  and  constraints,  and  suggests 

modification  of  the  process  for  optimization  (achievement  of 
sufficient  improvement). 

9.  Optimization  modifies  the  process  and  applies  appropriate  stages  of 
the  program  to  the  modified  process,  in  an  iterative  manner  until 
sufficient  improvement  is  achieved. 

10,  Design  of  experimental  and  simulation  trials  aids  optimization. 


For  the  design  of  a  new  flow  procese-  the  approach  of  this  program  may  be 
modified  to  yield  a  program  for  design  and  optimization  (Figure  3).  The 
application  to  a  process  or  system  in  general,  of  botn  the  program  for 
analysis  and  optimization  and  the  program  for  design  and  optimization,  is 
presented  in  Reference  2. 

REFINED  ANALYSIS  OF  THE  DATA 

Because  only  a  small  fraction  of  the  effort  expended  in  collecting  data  is 
typically  devoted  to  their  analysis,  a  l£“ge  amount  of  the  information  they 
contain  generally  is  undiscovered  and  unexploited.  A  more  profound  under¬ 
standing  of  the  flow  process  may  be  achieved  through  more  refined  analysis 
of  the  data,  as  follows: 

•  Investigation  of  the  effect  of  organization  size,  industry,  and 
interviewer  bias  upon  answers  to  questions. 

•  Improvement  in  the  arrangement  of  responses  to  a  question,  and  the 
association  of  a  numerical  "alue  with  each  response,  with  the  objec¬ 
tive  of  improving  the  linearity  of  relationships  among  questions. 


PROGRAM  FOR  DESIGN  AND  OPTIMIZATION 


•  Division  of  the  1,  500  users  into  appropriate  groups  for  analysis  and 
comparison,  such  as  the  three  groups  formed  by  those  who  acquired: 
conceptual  information,  design  and  performance  information,  and 
production  information. 

•  Incorporation  into  the  analysis  of  differences  between  the  corres¬ 
ponding  characteristics  of  the  desired  and  actually  received 
information,  and  additional  special  indexes. 

•  Reformulation  and  re-eetimation  of  appropriate  linear  relationships 
among  questions,  to  reflect  the  above  improvements  and  to  investi¬ 
gate  more  specific  relationships  which  involve  only  single  questions 
(rather  than  combinations  of  related  questions). 

•  Formulation  and  estimation  of  additional  linear  relationships  within 
the  flow  process,  such  as  those  which  reverse  the  input/output 
relations  of  the  flow  process  (under  the  characterization  below)-- 
for  the  investigation  of  the  selective  dissemination  of  information. 


CHARACTERIZATION  OF  THE  FLOW  PROCESS 


The  findings  of  the  investigation,  which  characterise  the  flow  process,  are 
highlighted  in  this  section.  They  are  illustrated  by  the  accompanying  figures, 
and  supported  by  the  numerical  results  in  Volume  HI  of  Reference  1, 

TYPES  OF  INFORMATION 

Almost  one -half  of  the  information  was  in  engineering  fields,  and  almost  two- 
fifths  of  it  was  in  scientific  fields  (see  Figure  4).  In  the  conceptual-design 
and  performance-production  c>clc,  over  60%  of  the  information  involved 
design  and  performance  (Figure  5). 

MEDIA  FOR  CONVEYING  INFORMATION 

Oral  information  was  wanted  more  than  one  out  of  three  times,  and  aemi- 
fonnally  written  information  also  was  wanted  more  than  one  out  of  three 

FIELD  OF  INFORMATION 


FIGURE  4 
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A.  THOSE  RESPONSES  WITH  OVER  3  PERCENT  ARE:  "ORAL  CONTACTS-ALL  OTHER"  <11*1  AND 
"ORAL  CONTACTS  WITH  MANUFACTURER"  0.5*1. 

8.  THOSE  RESPONSES  WITH  OVER)  PERCENT  ARE:  "PERSONAL  NOTES.  PERSONAL  LOGS  AND 
PERSONAL  FILES"  0*h  "CORRESPONDENCE.  MEMOS  ANO  TWX"  (6*V  AND  "DRAWINGS 
AND  SCHEMATICS"  <5<>. 

C.  THOSE  RESPONSES  WITH  OVER  5  PERCENT  ARE:  "SYSTEM  SPECIFICATION  DOCUMENTS  " 

(4. 5*1  AND  "MANUALS"  0.5*). 

D.  THOSE  RESPONSES  WITH  OVER)  PERCENT  ARE:  "JOURNALS"  M.  5*1  AND  TEXTBOOKS"  ().)*•. 
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FIGURE  9 
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FIGURE  10 


the  information,  the  information  need*  were  satisfied  within  the  allowable 
acquisition  time  (ace  Figure  11). 

UTILIZATION  OF  INFORMATION 

Over  two-fifths  of  the  information  waa  used  throughout  the  entire  taak,  and 
over  one-third  ot  it  waa  uacd  in  major  portions  of  the  taak  (Figure  12). 
Almost  80%  of  the  information  was  absolutely  essential  to  the  task,  and  over 
15%  of  it  was  extremely  helpful  in  the  task  (see  Figure  13). 


UTILIZATION  OF  THE  INFORMATION  SYSTEM 

Of  the  users,  95To  utilised  their  organisation's  technical  information  center 
(library),  and  over  50%  utilised  it  twice  a  month  or  more  (Figure  *4). 

Title  listings  or  abstracts  of  information  media  would  have  been  useful  for 
finding  more  than  two-fifths  of  the  needed  information.  However,  the 
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Technical  Abstract  Bulletin  (TAB)  was  utilised  by  less  than  two  out  of  five 
users;  and  it  was  unknown  to  over  two  out  of  five  users  (Figure  1  5).  Less 
than  20%  of  the  users  utilised  the  Scientific  and  Technical  Aerospace  Reports 
(STAR),  while  over  60%  of  them  did  not  know  of  it  (Figure  16). 

Over  two  out  of  five  users  encountered  difficulties  in  the  utilisation  of  the 
information  system.  Lack  of  timely  awareness  of  information  accounted  for 
almost  two-fifths  of  these  difficulties,  and  lack  of  timely  acquisition  of 
information  accounted  for  over  one-half  of  them  (sec  Figure  17). 


SCIENTIFIC  OR  TECHNICAL  TASKS 

More  than  50%  of  the  tasks  were  in  engineering  fields-  and  more  than  30%  of 
them  were  in  scientific  fields  (see  Figure  18).  In  the  research-development- 
production  cycle,  almost  two-thirds  of  the  tasks  were  development 
(Figure  19).  Two  out  of  three  tasks  involved  design  and  performance,  within 
the  conceptual-design  and  performance-production  cycle  (see  Figure  20). 
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CONCEPTUAL  DESIGN  AND  PERFORMANCE  •  PRODUCTION 
CYCLE  LOCATION  OF  TASK 


USERS  OF  SCIENTIFIC  AND  TECHNICAL  INFORMATION 

Over  one-half  of  the  users  held  engineering  positions,  and  almost  one-third 
oi  them  held  scientific  positions  (Figure  21).  In  the  research-development* 
production  cycle,  two  out  of  three  users  occupied  development  positions 
(Figure  22).  Of  the  users,  40%  were  not  managers,  and  over  30%  managed 
from  one  to  five  persons  (see  Figure  23).  More  than  one-half  of  the  users 
possessed  a  bachelor's  degree,  and  almost  one-third  of  them  possessed  an 
advanced  degree  (see  Figure  24). 


In  general,  these  significant  users  of  scientific  and  technical  information 
were  the  real  users  of  the  information  system- -and  also  the  ones  most 
frustrated  by  difficultie s  involving  its  use. 
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FLOW  PROCESS  FROM  AN  INPUT /OUTPUT  POINT  OF  VIEW 


For  design  and  analytic  of  the  flaw  process,  it  is  meaningful  to  consider  the 
flow  proceaa  from  an  input/output  point  of  view.  Input  represents  "tendency 
to  influence,  "  output  represents  "tendency  to  be  influenced,  "  and  an  arrow 
represents  "the  tendency  of  influence  from  input  to  output.  " 

The  components  of  the  flow  process  are  USER,  TASK,  UTILIZATION,  and 
SEARCH  AND  ACQUISITION.  For  the  flow  process  in  general.  USER  and 
TASK  act  as  input  components;  and  UTILIZATION  and  SEARCH  AND 
ACQUISITION  act  as  output  components  (Arrow  1  in  Figure  25).  The  other 
input/output  relations  among  components  of  the  flow  process  have  the 
following: 

•  USER  as  input  component,  and  TASK  as  output  component  (Arrow  2 
in  Figure  25). 

•  USER  as  input  component,  and  UTILIZATION  as  output  component 
(Arrow  3  in  Figure  25). 


•THE  ARROWS  POINT  FROM  INPUT  (TENDING  TO  INFLUENCE!  TO  OUTPUT  (TENDING  TO 
BE  INFLUENCED). 

FIGURE  25 


•  USER,  TASK,  and  UTILIZATION  as  input  componenta,  and  SEARCH 
AND  ACQUISITION  aa  output  component  (Arrows  marked  4  in 
Figure  25). 

Within  each  component,  there  are  input  factors  and  output  factora.  Factor 
representa  "combination  of  related  questions.  11  Figurea  26  through  JO  pre¬ 
sent  input  and  output  factora  for  USER,  TASK,  UTILIZATION,  SEARCH  AND 
ACQUISITION,  and  the  flow  proceaa,  reapectively.  In  theae  figurea,  input 
factors  are  ranked  in  order  of  their  overall  contribution  to  the  relationships 
within  the  stated  component(s). 

One  must  realize,  however,  that  the  statistical  techniques  of  the  analysis 
can  merely  characterize  a  relation.  They  cannot  imply  that  a  relation  is 
cause-and-effect,  for  this  can  only  be  determined  by  a  thorough  understanding 
of  the  flow  process. 
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ANALYSIS  OF  FLOW-PROCESS  DATA 


OVERVIEW  OF  THE  ANALYSIS 

The  data  consist  of  1,  500  transcribed  interviews,  each  containing  answers  to 
55  questions  having  allowable  responses  which  arc  qualitative,  and  8  questions 
having  allowable  responses  which  are  quantitative.  Figures  4  through  24 
summarise  the  responses  to  21  significant  questions,  and  Figures  26  through 
30  summarise  the  subject  of  all  but  eight  less-important  questions.  For  a 
complete  listing  of  questions  and  their  responses,  Appendixes  5  and  6  to 
Volume  II  of  Reference  1  should  he  consulted. 

Detailed  information  describing  small  portions  of  the  flow  process  is  provided 
by  one-way  and  two-way  frequency  distributions  (for  example,  Figures  4 
through  24).  A  one-way  frequency  distribution  is  the  distribution  of  the 
percentage  of  answers  to  a  question  that  corresponds  to  each  (allowable) 
question  response,  and  a  two-way  frequency  distribution  it  the  distribution 
of  the  percentage  of  answers  to  a  pair  of  questions  that  corresponds  to  each 
pair  of  (allowable)  question  responses  (see  Table  1). 

In  addition,  the  relationship  analysis  cycle  yields  general  information 
describing  both  small  and  large  portions  of  the  flow  process  (for  example, 
Figures  26  through  30).  In  this  cycle,  qualitative  question  responses  are 
transformed  into  numerical  form,  a  process  model  for  linear  rclationr  .ps 
among  questions  is  constructed  and  estimated,  and  numerical  relationship 
results  are  transformed  back  to  qualitative  form  (see  Figure  31). 

Transformation  of  qualitative  question  responses  into  numerical  form  is 
accomplished  by  arranging  the  responses  into  an  informative  detailed  (local) 
structure,  and  then  associating  a  meaningful  number  with  each  response. 

The  construction  of  a  process  model  for  linear  relationships  among  questions 
is  accomplished  by  arranging  the  questions  into  an  informative  general 
(global)  structure,  and  then  specifying  the  general  form  of  meaningful  linear 
relationships  among  questions.  Next  unspecified  constants,  in  the  general 
form  of  these  relationships  in  the  process  model,  are  estimated  front  the 


Table  1 

ONE-WAY  AND  TWO-WAY  FREQUENCY  DISTRIBUTIONS 


One-Way  Frequency  Distribution 

Question  22:  Desired  Volume  of  Information  Media 

Response 

Frequency  (%) 
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data  by  employing  the  statistical  technique  called  stepwise  regression  analy¬ 
sis.  Finally,  numerical  relationship  results  are  transformed  back  to  quali¬ 
tative  form  by  ranking  questions  in  the  order  of  their  contribution  to  each 
relationsnip,  and  then  in  the  order  of  their  overall  contribution  to  the  relation¬ 
ships  in  each  component  of  the  flow  process  and  the  flow  process  itself. 


The  relationship  analysis  cycle  is  believed  to  be  novel  in  the  field  of  infor¬ 
mation  science.  Its  employment  and  testing  in  this  investigation  have  yielded 
results  that  are  encouraging,  and  implications  for  the  future  that  are 
provocative. 

REQUIREMENTS  OF  TILE  ANALYSIS 

An  analysis  ought  to  operate  upon  the  data  in  such  a  way,  and  to  such  an 
extent,  that  the  analytical  requirements  are  met.  What  an  analysis  ought  to 
accomplish  is  determined  by  both  the  data  and  the  analytical  requirements. 
The  weaker  the  data  or  the  stronger  the  analytical  equirements,  the 
stronger  should  an  analysis  be. 

An  analysis  should  provide  a  bridge  between  the  data,  and  meaningful  con¬ 
clusions  and  recommendations.  It  should  bring  the  information  content  of  the 
data  into  focus.  It  should  transform  apparent,  chaos  into  orderly  findings, 
which  readily  lead  to  conclusions  and  recommendation*. 


To  Achieve  thie,  an  analysis  must  organize,  summarize,  and  interpret  the 
data.  The  methods  of  summarization  employed  by  an  analysis  ought  to  be 
sufficient  to  bring  both  the  detailed  and  general  information  content  of  the 
data  into  focus.  Higher-order  effects  are  indicated  by  detailed  information, 
whereas  lower-order  effects  are  indicated  by  general  information. 

Detailed  information  is  relatively  close  to  the  surface  of  the  data,  and 
requires  a  relatively  small  amount  of  summarization  to  be  brought  into  focus. 
The  more  the  detail,  the  less  the  summarization  required.  On  the  other 
hand,  general  information  is  buried  relatively  far  beneath  the  surface  of  the 
data,  and  requires  a  relatively  large  amount  of  summarization  to  be  brought 
into  focus.  The  more  the  generality,  the  more  the  summarization  required. 

By  its  very  nature,  detailed  information  describing  only  small  portions  of 
the  flow  process  may  be  comprehended  at  once.  However,  general  informa¬ 
tion  describing  either  small  or  large  portions)  of  the  flow  process  may  be 
comp-cnended  at  once.  That  is,  only  small  amounts  of  great  detail  may  be 
simultaneously  digested;  whereae,  either  small  or  large  amounts  of  little 
detail  may  be  simultaneously  digested. 

Consequently,  the  analysis  first  should  summarize  the  data  until  their 
detailed  information  content,  describing  only  small  portions  of  the  flow 
process  at  once,  is  brought  into  comprehensible  focus.  It  then  should  con¬ 
tinue  to  summarize  the  data  until  their  general  information  content, 
describing  both  small  and  large  portions  of  the  flow  process  at  once,  is 
brought  into  comprehensible  focus.  C.ierwise,  any  interested  person  will 
be  forced  to  accept  only  the  data's  detailed  information  content;  or  to  himself 
perform  additional  summarization,  so  that  the  data's  general  information 
content  is  brought  into  comprehensible  focus. 

STATISTICAL  CONCEPTS 

Tc  aid  the  translation  of  these  general  analysis  requirements  into  specific 
analysis  objectives,  pertinent  statistical  concepts  are  briefly  introduced  and 
discussed  in  the  following  paragraphs. 


Frequency  Distributions 

One-way  and  two-way  frequency  distributions  have  been  defined  above. 
Higher-order  frequency  distributions  are  similarly  defined.  Frequency 
distributions  necessitate  the  simplest  ope  ration  upon  the  data,  and  contain  a 
wealth  of  detailed  information  regarding  variation  in  the  data;  however,  they 
provide  the  minimal  amount  of  summarization. 

The  usual  procedure  for  summarizing  a  one-way  frequency  distribution  is  to 
combine  some  question  responses,  and/or  to  obtain  measures  of  the  one-way 
frequency  distribution's  location  and  spread.  The  distribution's  location  may 
be  measured  by  its  mode  if  the  qualitative  question  responses  are  not 
arranged  into  an  order,  by  its  median  if  the  qualitative  question  responses 
are  ordered,  and  by  its  mean  if  the  question  responses  are  quantitative. 
Measures  of  the  distribution's  spread  are  its  range  if  the  qualitative  question 
responses  are  ordered,  and  its  standard  deviation  if  the  question  responses 
are  quantitative.  More  definitive  information  is  obtained  by  this  summariza¬ 
tion  when  the  qualitative  question  responses  are  ordered,  and  even  more 
definitive  information  is  obtained  when  the  question  responses  are 
quantitative. 

Summarization  of  two-way  frequency  distributions  is  both  more  necessary 
and  more  difficult  to  perform.  The  first  step  is  to  combine  some  responses 
for  each  question,  and/or  to  obtain  measures  of  the  location  and  dispersion 
of  each  question's  one-way  frequency  distribution.  Then  a  measure  of  the 
association  or  interaction  between  the  two  questions  is  sought.  If  the  quali¬ 
tative  responses  to  each  question  are  ordered,  the  interaction  between  the 
two  questions  may  be  measured  by  the  rank  correlation  (coefficient);  and  if 
each  question's  responses  are  quantitative,  the  interaction  may  be  measured 
by  the  correlation  (coefficient).  An  indirect  approach  to  measuring  this 
interaction  when  the  question  responses  are  qualitative  is  provided  by  Chi- 
square,  which  indicates  the  departure  of  the  questions  from  being  independent 
or  not  related. 


Computation  of  the  rank  correlation  automatically  associates  the  numbers  1, 

2,  .  .  .  ,  r.  with  the  first,  second . nth  responses  to  these  questions.  On 

the  other  hand,  the  computation  of  toe  correlation  depends  upon  the  quantita¬ 
tive  responses  to  each  question,  or  the  numbers  associated  with  the  responses 
to  each  question. 

As  for  one-way  frequency  distributions,  more  definitive  information  is 
obtained  by  this  summarization  when  the  qualitative  question  responses  are 
ordered,  and  even  more  definitive  information  is  obtained  when  the  question 
responses  are  quantitative.  Arrangement  of  question  responses  into  an 
informative  order  is  called  development  of  a  detailed  structure,  while 
association  of  a  meaningful  number  with  each  response  is  called  definition 
of  a  numerical  description  for  the  detailed  structure.  The  development  of  a 
detailed  structure,  followed  by  the  definition  of  a  numerical  description  for 
the  detailed  structure,  transforms  the  qualitative  question  responses  into 
numerical  form. 

Higher-order  frequency  distributions  become  increasingly  harder  to  generate, 
depict,  and  comprehend.  Consequently,  their  summarization  becomes  both 
increasingly  more  necessary  and  more  difficult.  They  are  of  relatively  little 
analytical  use,  except  in  rare  instances. 

Relationships 

For  questions  with  quantitative  responses,  a  relationship  among  questions  is 
a  mathematical  expression  of  the  variation  in  one  question  as  a  function  of  the 
variations  in  the  other  questions.  It  is  frequently  both  convenient  and 
sufficiently  accurate- -for  example,  during  exploratory  research  such  as  thir 
investigation- -to  represent  a  relationship  by  a  linear  one,  which  depicts  the 
variation  in  one  question  as  a  linear  combination  of  the  variations  in  the  other 
questions.  The  general  form  of  a  linear  relationship  is  written: 

Y  =  PQ  +  Xj  +  ...  +  PpXp  +  «. 

with  Y  being  one  question,  Xj.  X^ . Xp  being  the  other  questions,  PQ1 

pj.  .  .  .  ,  Pp  being  the  unspecified  constants  or  coefficients,  and  •  being  the 


residual.  The  correlation,  in  reality,  meaaurea  the  degree  of  linearity  for 
the  interaction  between  the  two  auestiom;  or  the  cloirneai  of  tbs  two  quet- 
tions  to  being  adequately  represented  by  a  linear  relationship, 

V  =  0O  +  P,  X  +  «. 

between  one  question  Y  and  the  other  question  X. 

The  analysis  of  relationships  requires  not  only  quantitative  data,  but  also 
specification  of  the  general  form  for  meaningful  linear  relationships  among 
questions.  In  turn,  the  specification  of  the  general  form  for  these  relation¬ 
ships  requires  that  the  questions  be  arranged  into  an  informative  order. 
Arrangement  of  questions  into  an  informative  order  is  called  development  of 
a  general  structure.  The  development  of  a  general  structure,  followed  by 
the  specification  of  the  general  form  for  meaningful  linear  relationships 
among  questions  in  the  general  structure,  accomplishes  the  construction  of 
a  process  model.  Consequently,  the  analysis  of  relationships  depends  upon 
both  the  transformation  of  qualitative  question  responses  into  numerical 
form,  and  the  construction  of  a  process  model  for  linear  relationships  among 
questions. 

Comparison 

Two-way  frequency  distributions  are  easy  to  generate,  and  their  concept  is 
easy  to  understand.  They  summarize  relatively  little,  however,  and  their 
information  content  is  difficult  to  comprehend  without  additional  summarisa¬ 
tion.  On  the  other  hand,  relationships  are  rot  as  easy  to  obtain  and  to 
understand  in  concept;  but  they  do  summarize  a  great  deal,  and  their  infor¬ 
mation  content  is  easy  to  comprehend  without  additional  summarization. 

Let  the  responses  to  one  question  be  associated  with  the  X-axis,  and  the 
msponses  to  another  question  be  associated  with  the  Y-axis.  Then  a  two- 
way  frequency  distribution  rnay  be  viewed  as  a  geometric  representation  for 
the  distribution  of  the  answers  to  the  two  questions,  in  which  each  percentage 


give*  the  proportion  of  answer-pairs  which  are  associated  with  the  corres¬ 
ponding  response -pair  point.  In  addition  a  linear  relationship, 

Y  =  po  +  pj  X  +  «. 

may  be  viewed  as  a  natural  summarisation  of  the  two-way  frequency 
distribution.  It  replaces  the  geometric  representation  of  the  distribution 
with  a  line  through  it,  and  with  an  analytic  representation  of  the  distribution 
and  the  line.  The  more  the  distribution  tends  to  cluster  closely  around  a 
line,  the  more  appropriate  is  a  linear  relationship;  and  the  higher  is  the 
correlation  between  the  two  questions.  Figure  32  presents  an  example, 
using  the  two-way  frequency  distribution  from  Table  1  (for  which  a  linear 
relationship  is  not  very  appropriate). 

Although  two-way  frequency  distributions  may  be  summarized  to  present 
some  general  information  regarding  the  interaction  of  the  two  questions,  they 
are  limited  to  describing  only  small  portions  of  the  flow  process  at  once. 


REPRESENTATION  OF  A  TWO-WAY  FREQUENCY  DISTRIBUTION 
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Relationships,  however,  are  not  limited  at  all,  and  may  be  used  to  describe 
either  small  or  large  portions  of  the  flow  process.  In  addition,  relationships 
sufficiently  summarize  the  data  by  an  analytic  representation,  to  bring  its 
general  information  content  into  focus.  They  provide  a  natural  summariza¬ 
tion  of  not  only  two-way,  but  also  higher-order,  frequency  distributions. 

For  a  detailed  analysis  of  the  data,  two-way  frequency  distributions  are 
necessary.  The  analysis  of  relationships  is  required  for  a  general  analysis 
of  the  data.  In  addition,  it  is  useful  for  such  purposes  as  the  planning  and 
analysis  of  additional  investigations,  and  the  program  for  analysis  and  opti¬ 
mization  of  the  process.  Relationships  provide  a  global  view  of  large  portions 
of  the  flow  process,  which  also  enables  many  small  portions  of  the  process  to 
be  examined  simultaneously  and  their  relative  importance  evaluated. 

The  analysis  of  relationships  has  many  advantages  over  the  generation  of 
two-way  frequency  distributions.  One  must,  however,  realize  that  these 
advantages  have  to  be  paid  for  by  the  transformation  of  qualitative  question 
responses  into  numerical  form,  and  the  construction  of  a  process  model  for 
relationships  among  questions.  In  addition,  the  relationship  results  should 
be  analyzed  and  interpreted  by  techniques  which  are  relatively  insensitive 
to  changes  in  the  transformation. 

OBJECTIVES  OF  THE  ANALYSIS 

The  summarization  of  data,  to  bring  into  focus  their  detailed  information 
content  describing  small  portions  of  the  flow  process,  could  be  achieved  by 
means  of  one-way  and  two-way  frequency  distributions  for  single  questions 
and  pairs  of  questions.  An  analysis  of  relationships  among  questions  could 
accomplish  the  additional  summarization  of  data,  to  bring  into  focus  their 
general  information  content  describing  both  small  and  large  portions  of  the 
flow  process. 

Qualitative  question  responses,  however,  pose  a  problem.  Although 
frequency  distributions  may  be  generated  for  qualitative  question  responses, 
they  provide  much  more  definitive  information  for  quantitative  question 


response*.  The  Analysis  of  relationships,  as  noted  above,  requires  both  the 
transformation  of  qualitative  question  responses  into  numerical  form,  and 
the  construction  of  a  process  model  for  relationships  among  questions. 

Thus,  the  objectives  of  the  analysis  are  to: 

•  Generate  one-way  and  two-way  frequency  distributions  for  single 
questions  and  pairs  of  questions. 

•  Transform  qualitative  question  responses  into  numerical  form. 

•  Construct  and  estimate  a  process  model  for  linear  relationships 
among  questions. 

e  Analyse  and  interpret  the  frequency  distribution  and  relationship 
results,  to  provide  meaningful  conclusions  and  recommendations 
which  are  relatively  insensitive  to  changes  in  the  transformation. 

FREQUENCY  DISTRIBUTIONS 

A  one-way  frequency  distribution  has  been  generated  for  59  of  the  63  ques¬ 
tions.  The  remaining  four  questions  were  narrative  and  were  not  categorised. 

From  the  large  number  of  two-way  frequency  distributions  that  could  have 
been  generated,  196  were  selected  for  compilation.  These  were  supple¬ 
mented  by  the  analysis  of  relationships  and  the  complete  correlation  matrix, 
which  was  a  by-product  of  that  analysis. 

One-way  frequency  distributions  were  transcribed  from  the  marginal 
distributions  of  the  appropriate  two-way  frequency  distribution*.  The  com¬ 
puter  program  employed  to  generate  two-way  frequency  distributions  was 
BMD  08D  (see  Reference  3). 

TRANSFORMATION  OF  QUESTION  RESPONSES 

As  noted  above,  the  transformation  of  qualitative  question  responses  into 
numerical  form  is  accomplished  by  the  development  of  a  detailed  structure, 
and  then  the  definition  of  a  numerical  description  for  that  detailed  structure. 


A  detailed  structure  for  question  responses  ii  developed  to  serve  at  the  baala 
for  the  transformation  of  these  responses.  In  addition,  the  detailed  structure 
brings  the  local  aspects  of  the  flow  process  into  focus,  and  provides  a 
foundation  for  a  general  structure.  This  detailed  structure  is  formed  by  the 
informative  arrangement  of  question  responses. 

The  first  step  is  to  specify  the  primary  unifying  characteria'ic  of  each 
question's  responses.  This  response  characteristic  should  b  •  determined 
from  not  only  the  responses  themselves,  but  also  the  question's  intent. 

The  next  step  is  to  collect  into  groups  those  question  responses  which  are 
related  by  the  response  characteristic.  According  to  this  characteristic, 
an  ordering  is  then  arranged  for  groups  and,  to  the  extent  feasible,  for 
responses  within  groups.  All  responses  to  a  question  may  he  arranged  into 
one  ordering,  if  all  responses  within  each  group  may  be  arranged  into  an 
ordering.  According  to  the  response  characteristic,  a  response  or  a  group 
of  responses  is  more  similar  to  responses  or  groups  of  responses  which  are 
closer  to  it  in  the  arrangement,  than  to  those  farther  away. 

Depending  upon  the  implications  of  the  response  characteristic,  there  are 
three  types  of  detailed  structure: 

•  Visible  structure,  explicitly  implied  by  the  response  characteristic. 

•  Partially  visible  structure,  implicitly  implied  by  the  response 
characteristic. 

•  Invisible  structure,  not  implied  at  all  by  the  response  characteristic. 

A  visible  structure  is  obvious,  and  possesses  no  flexibility.  A  partially 
visible  structure  is  apparent,  but  possesses  some  flexibility.  An  invisible 
structure  must  be  inferred,  and  possesses  considerable  flexibility.  The 
position  of  responses  in  the  arrangement  is  meaningful  in  a  visible  structure, 
and  indicative  in  a  partially  visible  structure,  but  only  descriptive  in  an 
invisible  structure. 


Example*  of  visible,  partially  viaible,  and  invisible  structure*  are  given  in 
Table*  2  through  4,  respectively.  For  the  tables,  the  Arabic  numerals  in 
parentheses  indicate  the  ordering  in  the  interview,  while  the  Homan  numer¬ 
als  indicate  the  ordering  in  the  detailed  structure.  The  numerical  descrip¬ 
tion  scale  is  included  in  the  table*. 

Definition  of  a  Numerical  Description 

When  the  detailed  structure  is  developed,  its  numerical  description  is 
appropriate.  By  associating  a  number  with  each  ordered  question  response, 
the  numerical  description  provides  a  more  exact  differentiation  among 
responses,  and  enables  estimation  of  the  process  model  which  is  constructed 
for  linear  relationships  among  questions.  The  numerical  Hesc ription  also 
rupresents  the  data  in  a  form  to  which  a  large  variety  of  numerical 
techniques  may  be  applied. 

According  to  the  response  characteristic,  the  base  point  (zero)  for  a  numeri¬ 
cal  scale  is  selected.  With  each  response,  there  is  associated  a  numerical 
value  corresponding  to  Us  relative  distance  from  the  base  point. 


Table  2 

TRANSFORMATION  OF  QUESTION  RESPONSES:  VISIBLE  STRUCTURE 


Question 

1  3: 

Desired  Acquisition  Time 
Response  Characteristic. 

for  Information 

Days 

Informative  Order 

Scale 

I 

(01) 

F rom  recall 

0.  00 

II 

(02) 

Less  than  1  day 

0.  01 

III 

(03) 

1  to  7  day« 

0.  05 

IV 

(04) 

8  to  30  oays 

0.  20 

V 

(05) 

31  to  90  days 

0.  f>0 

VI 

(06) 

Over  90  days 

1.  oo 

Table  3 

TRANSFORMATION  OF  OUKSTlON  RESPONSES: 
PARTIALLY  VISIBLE  STRUCTURE 

Question 

14: 

First  Source  Contacted  for  Information 

Response  Characteristic:  Distance  from  User 

Informative  Order 

Scale 

I 

(01) 

Received  with  task  assignment 

0.  00 

II 

(04) 

Recalled  it 

0.  05 

III 

(09) 

Searched  own  collection 

0.  10 

IV 

(19) 

Respondent's  own  action 

0.  15 

V 

(03) 

Assigned  subordinate  to  get  it 

0.  20 

VI 

(05) 

Asked  a  colleague 

0.  25 

VII 

(02) 

Asked  my  supervisor 

0.  30 

VIII 

(08) 

Requested  search  of  departmental  files 

0.  35 

IX 

(06) 

Asked  an  ii  tcrnal  consultant 

0.  45 

X 

(10) 

Sea rched  o rgani nation's  technical  information  1 
center. 

*  0.  50 

X 

(07) 

Requested  technical  information  center 
search.  ] 

XI 

(15) 

Requested  data  from  vendor,  manufacturer, 
or  supplier- 

0.  60 

XI 

(14) 

Searched  vendor,  manufacturer,  or  supplier  I 
sources  ] 

XII 

(11) 

Searched  outside  technical  information  center 

0.  70 

XIII 

(18) 

Asked  an  external  consultant  or  expert 

0.  HO 

XIV 

(13) 

Requested  search  of  Government  Information 
Cente  r 

*  0,  90 

XIV 

(12) 

Searched  Government  Information  Center 

XV 

117) 

Asked  customer 

1 .  00 

vNo  distinction  is  made  between  the  two  responses  in  this  group  of  related 
responses-. 


Table  4 

TRANSFORMATION  OF  QUESTION  RESPONSES:  INVISIBLE  STRUCTURE 


Question  27:  Desired  Layout  of  Information  Media 
Response  Characteristic:  Formality 


Informative  Order 

Scale 

I 

(14) 

Recall 

0.  00 

11 

(13) 

Telephone  conversation 

0.  06 

III 

(11) 

Group  discussion 

0.  12 

-IV 

(04) 

Photographs 

0.  19 

V 

(03) 

Graphics  (diagrams,  drawings,  schematics, 
flow  charts,  graphs,  maps) 

0.  25 

VI 

(02) 

Tables  or  lists 

0.  31 

VII 

(01) 

Narrative  text 

0.  37 

VIII 

(18) 

Narrative  text,  and  tables  or  lists 

0.  44 

IX 

(09) 

G  raphics  and  lists 

0.  50 

X 

(08) 

Photographs  and  text 

0.  56 

XI 

(07) 

Graphics  and  text 

0.  63 

XII 

(16) 

Graphics,  text,  and  oral 

0.  69 

XIII 

(17) 

Graphics,  text,  oral,  and  recall 

0.  75 

XIV 

(12) 

Informal  briefing,  with  chalk  or  pencil 
drawings 

0.  82 

XV 

(05) 

Microfilm  or  microfiche 

0.  88 

XVI 

(06) 

Slides  or  motion  pictures 

0.  94 

XVII 

(10) 

formal  briefing  or  lecture 

1.  00 

Except  for  two  questions,  1,  0,  or  a  positive  integer  is  associated  with 
each  question  response.  The  two  exceptional  questions  have  multiples  o! 
one-half  associated  with  some  responses  for  convenience.  When  it  is  mean¬ 
ingful  to  consider  the  response  to  be  null,  0  is  used;  and  when  it  is  meaning¬ 
ful  to  consider  the  response  as  opposite  in  direction  to  the  remaining 
rctponscs,  -J  is  used.  Variable  spacing  between  the  associated  numbers 
indicates  that  the  responses  exhibit  variable  similarity,  or  distance  irom 


each  other,  according  to  ti  e  response  characteristic.  The  game  number  iB 
associated  with  two  responses  to  a  question  if--and  only  if- -the  two 
responses  are  in  the  same  group  of  related  responses,  and  the  responses 
within  that  group  are  not  arranged  into  an  ordering  (that  is,  are  considered  to 
be  the  saute  distance  from  the  base  point). 

The  association  of  a  number  with  each  question  response  associates  a  scale 
of  possible  numerical  values  with  the  question.  Then  all  numerical  values 
in  the  6cale  are  divided  by  the  largest  one,  so  that  the  scale  is  normalized  to 
between  -1  and  l--and  usually  between  0  and  1. 

The  value  of  the  numerical  description  is  meaningful  for  responses  in  a 
visible  structure,  and  indicative  for  responses  in  a  partially  visible  struc¬ 
ture,  but  only  descriptive  for  re  spe  'its  in  an  invisible  structure.  Examples 
are  again  provided  by  Tables  2  thro  gh  4. 

A  detailed  structure  suggests  its  own  numerical  description  when  the  question 
responses  have  been  properly  arranged.  For  a  more  refined  relationship 
analysis,  a  numerical  description  could  be  altered  to  improve  the  linearity 
of  important  relationships  wh  -h  involve  the  corresponding  question. 

CONSTRUCTION  OF  A  PROCESS  MODEL 

Development  of  a  general  structure,  and  specification  of  the  general  form 
for  meaningful  linear  relationships  among  questions  in  the  general  structure, 
accomplish  the  construction  of  a  process  model. 

Development  of  a  General  Structure 

A  general  structure  now  is  developed  to  serve  as  the  basis  for  the  construc¬ 
tion  of  a  process  model  for  linear  relationships  among  questions,  and  to 
bring  the  global  aspects  of  t lie  flow  process  into  focus.  This  general  struc¬ 
ture  is  formed  by  the  informative  arrangement  of  questions. 

The  first  slop  is  to  identify  the  components  of  the  flow  process  as  USER, 
TASK,  UTILIZATION,  and  SEARCH  AND  ACQUISITION.  The  next  step  is  to 


form  group*  of  related  questions  within  component*.  Then  an  ordering  is 
arranged  for  components,  group*  within  component*,  and  questions  within 
groups.  To  the  extent  feasible,  the  arrangement  should  possess  the  desirable 
characteristic  that  a  question  tend*  to  influence  only  those  question*  which 
follow  it. 

It  is  frequently  both  convenient  and  sufficiently  accurate  - -for  example,  during 
exploratory  research  such  as  thi*  investigation- -to  combine  groups  of  related 
questions.  The  combination  of  related  questions  summarizes  the  general 
structure,  and  simplifies  the  specification  and  estimation  of  meaningful 
linear  relationships  among  questions. 

Two  of  the  simplest  types  of  combinations  are  averages  and  products.  They 
keep  the  combination  scales  normalized  to  between  -1  and  1,  Except  for  the 
four  cases  in  which  a  product  of  two  questions  is  employed,  all  of  the  combi¬ 
nations  are  averages  of  two  questions. 

A  special  USER-TASK  flexibility  index  F  summarizes  the  flexibility  exhibited 
by  the  difference  between  the  user's  position  within  the  researrh-devclopmert- 
production  cycle  and  that  of  his  task;  and  the  difference  between  the  user's 
field  of  position,  and  that  of  his  task.  To  summarize  the  effort  expended  by 
the  user  in  his  utilization  of  the  information  system  and  the  problems 
encountered  by  him  in  this  utilization,  the  respective  special  indexes,  E  for 
UTILIZATION  effort  and  P  for  UTILIZATION  problems,  are  introduced. 

The  inadequacy  of  the  search  and  acquisition  process,  for  information  used 
ir,  task  performance,  is  summarized  by  the  special  index  I  for  SEARCH  AND 
ACQUISITION  inadequacy.  The  scales  for  F.  E,  P,  and  I  are  also  normalized 
to  between  -1  and  1. 

An  example  is  provided  by  Table  S,  which  also  includes  linear  relationships. 

In  this  table .  Q  denotes  Question,  and  P0,  pj.  P;> . R&  symbolize  general 

unspecified  constanto  in  the  relationships.  For  simplicity,  the  same  symbols, 
Rq,  pj,  •  •  •  ,  Pf,.  are  uoed  in  each  relationship;  although  they  are  not 

meant  to  denote  the  same  constants. 


Table  5 

CONSTRUCTION  OF  A  PROCESS  MODEL:  USER  COMPONENT* 


1 .  User's  age :  Q48 

2.  User's  education 

A.  User's  highest  degree:  Q50A  =  fig  +  (3|  (Q48) 

B.  User's  field  of  degree:  Q50C  =  Po  +  Pi  (Q48) 

3.  User's  experience 

Combination:  1 IZ  (Q51  +  Q52)  =  0q  +  Pi  (Q48) 

A.  User's  job  experience:  Q51 

B.  User's  company  experience:  Q52 

4.  User's  position 

A.  User's  position  within  the  research-development-production  cycle 

Q55  =  p0  +  flj  (Q48)  +  p2  (QSOA)  +  p3  (O50C) 

+  (3 4  (1/2  (Q 51  +  Q 52) ) 

B.  User's  field  of  position 

QS6  =00  +  0!  (Q48)  +  p2  (Q50A)  +  03  (Q50C) 

+  04  (1/2  (Q51  +  Q52)) 

5.  User's  level 
Combination: 

1/2  (Q49  +  Q58)  -  0O  +  0j  (Q48)  +  pz  (Q50A)  +  03  (Q50C) 

+  P4  (1/2  (Q5 1  +  Q52);  +  P5(Q55)  +  06  (Q56) 

A.  User' 8  salary  level:  Q58 

B.  Number  of  personnel  managed  by  user:  Q49 

*Q  denotes  Question:  and  p(),  0;,  02,  .  ,  p£  symbolize  general  unspecified 
constants  in  the  relationships.  For  simplicity,  the  same  symbols,  Po.  Pi. 
02<  •••>  06>  are  used  for  each  relationship:  Ithough  they  are  not  meant 
to  denote  the  same  constants. 


A  question  combination  (component)  which  tends  to  influence  other  combina¬ 
tions  of  questions  (components)  is  called  an  input  factor  (component),  and  a 
combination  of  questions  (component)  which  tends  to  be  influenced  by  ether 
question  combinations  (components)  is  called  an  output  factor  (component). 


The  terms,  combination  of  questions  ana  question  combination,  also  are  used 
to  cover  the  degenerate  case  of  a  single  question- -for  example,  Q56  in 
Table  5.  Arrangement  of  components  and  question  combinations  within 
components,  according  to  an  input/nutput  point  of  view,  facilitates  the 
specification  of  the  general  form  for  meaningful  linear  relationships  among 
combinations  of  questions.  It  also  provide*  insight  into  the  flow  process. 

TWhen  a  more  refined  relationship  analysis  is  desired,  the  question  combina¬ 
tions  could  be  separated,  and  more  special  summarizing  indexes  could 
perhaps  be  defined. 

Sfiecifi  cation  of  the  General  Form  for  Relationships 

Once  the  general  structure  is  developed  and  groups  of  related  questions  are 
combined,  it  is  appropriate  to  specify  the  general  form  for  meaningful  linear 
relationships  among  combinations  of  questions  in  the  general  structure. 

Analysis  of  the  general  structure,  from  an  input/output  point  of  view,  yields 
those  question  combinations  wnich  are  judged  to  be  potentially  related  to 
each  combination  of  questions  in  the  general  structure.  Only  the  potentially 
related  question  combinations  are  included  in  the  general  form  of  the  linear 
relationship,  for  that  combination  of  questions.  An  example  is  provided  by 
Table  5. 

When  the  questions  have  been  properly  arranged  and  summarized  by  combina¬ 
tion,  a  general  structure  suggests  the  general  form  for  meaningful  relation¬ 
ships.  A  more  refined  relationship  analysis  could  specify  the  general  form 
for  additional  relationships,  particularly  those  necessitated  by  the  separation 
of  question  combinations. 


ESTIMATION  OF  RELATIONSHIP  , 

The  unspecified  constants,  in  the  general  form  of  meaningful  linear  relation¬ 
ships  among  combinations  of  questions,  are  estimated  from  the  numerically 
transformed  question  responses  by  the  statistical  technique  called  stepwise 


regression  analysis.  Reference  3  presents  a  complete  description  of  tiiis 
technique.  A  brief  discussion  of  only  the  pertinent  aspects  of  stepwise 
regression  analysis  follows. 

Stepwise  regression  analysis  estimates  the  relationship  in  steps,  by  entering 
one  question  combination  at  a  time.  At  eao  step,  the  question  combination 
which  is  entered  is  the  one  that  adds  the  greatest  contribution  to  the  relation¬ 
ship  from  the  previous  step.  A  measure  of  this  contribution  is  the  F  to 
enter  of  that  question  combination.  The  contribution  of  a  question  combina¬ 
tion  to  the  relationship  at  each  step  is  measured  by  its  F  to  remove  at  that 
step,  and  the  significance  of  the  relationship  at  each  step  is  measured  by  the 
multiple  correlation  (coefficient)  at  that  step.  Relative  significance  within  a 
relationship  is  indicated  by  the  former,  while  relative  significance  among 
relationships  is  indicated  by  the  latter.  In  addition,  the  potential  contribu¬ 
tion  to  the  relationship  at  each  step,  of  some  question  combinations  which 
were  not  included  in  the  general  form  of  the  relationship,  is  measured  by 
their  potential  F  to  enter  at  tiiat  step. 

The  computer  program  employed  for  the  stepwise  regression  analysis  is 
BMD  02R  (Reference  3). 

TRANSFORMATION  OF  RELATIONSHIP  RESULTS 

The  stepwise  regression  computer  printouts  conta.n  a  wealth  of  numerical 
detail,  concerning  relationship  results  and  their  significance.  To  make  the 
conclusions  of  the  relationship  analysis  relatively  insensitive  lo  the  trans¬ 
formation  of  qualitative  question  responses  into  numerical  form,  the  numeri¬ 
cal  relationship  results  must  be  transformed  back  to  qualitative  form.  In 
addition,  summarization  of  the  numerical  detail  is  quite  informative. 

Both  of  these  requirements  are  accomplished  by  a  ranking  procedure  which: 

•  Ranks  question  combinations  in  the  order  of  their  contribution  to 
each  relationship. 


•  Then  rank*  question  combination*  in  the  order  of  their  overall 
contribution  to  the  relationahip*  in  each  component  of  the  flow 
process,  and  the  flow  process  itself. 

The  former  focuses  upon  a  given  combination  of  questions,  and  observes 
which  question  combinations  are  most  significantly  related  to  it;  while  the 
latter  focuses  upon  the  appropriate  collection  of  combinations  of  questions, 
and  observes  which  question  combinations  are  moat  significantly  related  to 
most  of  them. 

Ranking  of  Contributions  to  Relationships 

An  effective  step  in  the  stepwise  regression  analysis,  beyond  which  relatively 
little  is  contributed  to  the  relationship,  is  determined  when  the  F  to  enter  of 
-the  question  combination  entering  at  that  step  becomes  less  than  some  lower 
bound.  Analysis  of  the  stepwise  regression  computer  printouts  indicates  that 
a  reasonable  value  for  this  lower  bound  is  6.66  (F  level  of  0.  01).  When  a 
question  combination  is  included  in  the  relationship  at  the  effective  step,  it 
is  said  to  be  related  to  the  given  combination  of  questions.  Those  question 
combinations,  whose  potential  F  to  enter  at  the  effective  step  is  at  or  above 
6.66,  are  said  to  be  candidates  for  the  relationship. 

For  each  combination  of  questions  in  the  general  structure,  the  question 
combinations  which  arc  related  to  it  are  ranked  in  the  order  of  their  contri¬ 
bution  to  the  relationship.  Table  6  contains  an  example. 

Ranking  of  Contributions  to  Components  and  Flow  Process 

These  rankings  of  contributions  may  be  obtained  by  properly  combining  the 
rankings  of  contributions  for  the  appropriate  collection  of  relationships.  To 
accomplish  this,  numerical  values  must  be  assigned  to  the  relationship 
rankings.  This  return  to  numerical  form  is,  however,  an  artifice  and  only 
temporary. 

The  procedure  assigns  a  value  to  a  relationship  ranking  as  follows:  0  to  the 
given  combination  of  questions,  1  to  the  question  combination  making  the 
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largest  contribution  to  the  relationship,  2  to  the  question  combination  making 
the  second  largest  contribution  to  the  relationship,  ....  m  to  the  question 
combination  making  the  smallest  contribution  to  the  relationship;  m  +  1  to  the 
candidate  for  the  relationship  potentially  making  the  largest  contribution  to 
the  relationship,  m  +  2  to  the  candidate  for  the  relationship  potentially  making 
the  second  largest  contribution  to  the  relationship,  ....  p  <  11  to  the  candi¬ 
date  for  the  relationship  potentially  making  the  smallest  contribution  to  the 
relationship;  and  12  to  those  question  combinations  which  do  not  appear, 
although  they  might  have  appeared  according  to  the  general  structure  and  the 
input/output  view  of  the  flow  process.  This  value  was  selected  because  no 
combination  of  questions  had  more  than  11  question  combinations,  which 
were  either  related  to  it  or  were  candidates  for  the  relationship. 

Now  the  sum  of  these  numerical  values  is  computed  for  a  question  combina¬ 
tion  over  each  component,  and  over  their  aggregate  for  the  flow  process. 

Then  the  sums  for  each  component  and  thoae  for  the  flow  process  are  ranked 
among  themselves,  in  order  of  increasing  size.  Only  a  few  ambiguities  were 
present  in  computing  these  rankings  and  their  sums.  They  involved  questions 
which  occurred  in  relationships,  both  alone  and  in  question  combinations. 
These  questions  were  always  associated  with  the  appropriate  question 
combination  which  contained  them.  Table  7  contains  an  example. 

For  a  more  refined  ranking,  the  significance  of  the  actual  or  potential 
contribution  to  a  relationship  and  the  significance  of  a  relationship  could  be 
employed  to  compute  weights  for  use  in  calculating  a  weighted  turn,  upon 
which  to  base  the  ranking.  A  question  combination  appears  to  make  a 
significant  contribution  to  the  relationship,  when  its  F  to  remove  at  the 
effective  step  is  between  30  and  90  (30  s  F  to  remove  <  90);  and  appears  to 
make  a  highly  significant  contribution  to  the  relationship,  when  its  F  to 
lemove  at  that  step  is  at  or  about  90  (F  to  remove  i  90).  If  the  multiple 
correlation  at  the  effective  step  is  at  or  above  0.  40  in  absolute  value,  then 
the  relationship  seems  to  be  significant.  Those  question  combinations, 
whose  potential  F  to  enter  at  the  effective  step  is  at  or  above  30,  appear  to 
potentially  make  a  significant  contribution  to  the  relationship. 
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32 

8 
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•>0 
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Question  combination  rank 

2 

1 

3 

4 

5-1/2 
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’■‘Table  entries  are  assigned,  according  to  order  of  appearance  in  Table  6,  as 
follows:  0  to  combination  of  questions  in  "Characteristic"  column:  1  to  first 
question  combination,  2  to  second  question  combination,  ...  ni  to  last 
question  combination  in  "Related  To"  column;  m  t  1  to  first  question  combi¬ 
nation,  m  4  2  to  second  question  combination,  ....  p  s  1  1  to  last  question 
combination  in  "Candidate  for  Relationship"  column;  and  12,  which  is 
omitted  for  simplicity,  to  those  question  combinations  not  appearing. 


gement  and 


It  is  both  informative  and  suggestive  to  characterise  combinations  of  ques¬ 
tions  aa  input  factora  and  output  factora,  in  designing  and  analyzing  the  flow 
process  (set  Figures  Z5  through  30).  One  must  realize,  however,  that 
stepwise  regression  analysis  can  merely  estimate  and  indicate  the  »:jnifi- 
cance  of  a  relationship.  It  cannot  imply  that  the  relationship  ia  cause-and- 
effect,  for  this  can  only  be  determined  by  a  thorough  understanding  of  the 
flow  process.  Therefore  the  terms,  input  factor  and  output  factor,  are  used 
in  full  recognition  of  the  attendant  advantages  and  disadvantages. 
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Appendix 

PA R TIC  1 P ATI NC i  ORGAN) / A T1 ONS 

Table  A-l  lists  the  organizations  whose  personnel  were  inte 

investigation. 

Table  A-l  (Paj»e  1  of  3) 

ORGANIZATIONS 

rviewed  lor  the 

Numb'  r  of 

Popu  lat  ion  01 

Pc  r  sons 

Qualified 

Organization 

Into  r  viewed  He  ''sonnel 

Aerospace  Corporation 

Z  5 

1  ,  800 

Allegheny  Ludlum  Stool  Corporation 

1 

HI) 

Allis  -  Cha  Into  rs  Manufacturing  Company 

2 

1  *5 

American  Machine  &  Foundry  Company 

1 

100 

Ampex  Co  rpo  ration 

in 

760 

Arthur  0.  Little,  Inc. 

7 

MlO 

Armstrong  Cork  Company 

4 

210 

AYCO  Corporation,  Research  and 

Development  Division 

< ! 

3,  500 

The  Babcock  Wilcox  Company 

3 

250 

Battelle  Memorial  Institute 

1 1 

775 

Bechtel  Corporation 

i 

7  0 

Hooch  Aircraft  Corporation 

6 

470 

Bell  Aerosystems  Company 

1 1 

i ,  ono 

Bell  6  Howell  Research  Center 

3 

5<J0 

The  Bendix  Corporation 

6 

SiiO 

Bissett  -  Bo  rman  Co  rpo  rat  ion 

i 

o  5 

The  Hoeing  Coir.pany 

64 

b,  bOO 

Cult  Indus!  ru's,  ! it c . 

s 

72  5 

Cornell  Aeronautical  Laboratory,  Inc. 

6 

450 

Corning  Glass  Works 

3 

450 

Do  Laval  Turbine,  Inc. 

2 

160 

Douelas  Aircraft  Company,  Ini. 

66 

,  6  -4  5 

Dupunt  Company,  lnt. 

45 

3,  200 

Klee  trie  Storage  Battery  Company 

1 

200 

Kmerson  Klee  trie  Company  of  St.  Louis 

5 

32  5 

I\ii  ri  hild-II  ille r  Corporation.  Republic 

A  vi a t  i  1 1 1 1  division 

1 

: ‘1  he  person  Iron*.  Republic  Aviation  had  just 

joined  the  com 

pany  at  which*  he 

was  interviewed.  Ilis  answers  to  questions 

reflect  his  position,  task,  and 

so  forth,  at  Republic  Aviation. 

57 


Table  A  -  1  (Page  2  oi  3) 


Organisation 

Number  of 
Pe  r suns 
Into  rvie  wed 

Population  of 
Qualified 
Personnel 

OCA  Corporation,  Technology  Division 

3 

1  45 

General  Dynamics  Corporation 

129 

1  3.  155 

General  Precision,  Inc.  ,  Link  Group 

8 

315 

Goodway  Printing  Company,  Inc. 

3 

200 

Hamilton  Watch  Company 

1 

1  10 

Hazeltine  Corporation 

1  0 

800 

Hercules  Powder  Company 

23 

1,  350 

Honeywell,  Inc.  ,  Aeronautical  Division 

12 

910 

HRB  Singe  r,  Inc. 

6 

3  8  5 

IBM,  Federal  Systems  Divirion 

3-1 

3,  780 

Inge  r  soil  -  Rand  Company 

1 

55 

Institute  for  Defense  Analysis 

1  5 

400 

Institute  ol  Science  k  Technology 

International  Harvester  Company, 

4 

475 

Solar  Division 

4 

250 

International  Resistance  Company 

Johns  Hopkins  University,  Applied  Physics 

1 

65 

Laboratory 

14 

860 

Kollaman  Instrument  Corporation 

4 

250 

Lear  Siegler,  Inc.  ,  Power  Equipment  Division 

9 

255 

Leesona  Moos  Laboratories 

1 

1  00 

Linp-Tcmco - Vought,  Inc. 

63 

3,  500 

Loral  Electronics  Systems 

4 

350 

Lord  Corporation 

2 

125 

Lundy  Electronics  k-  Systems,  Inc. 

1 

60 

Management  Systems  Corporation 

1 

20 

Massachusetts  Institute  ol  Technology 

32 

2,  000 

Monsanto  Company 

44 

3,  500 

Martin  Company 

100 

7,  000 

McDonnell  Aircraft  Corporation 

27 

1,  900 

Melpar,  Inc. 

8 

900 

Menasco  Manufacturing  Company 

North  American  Aviation,  Inc.  .  Columbus 

1 

65 

Division 

North  American  Aviation.  Inc.  ,  Divisions  in  the 

21 

1  ,  570 

Los  Angeles  Metropolitan  Area 

269 

18, 500 

Northrop  Corporation 

29 

1  .  730 

Olin  Researcli  Center 

4 

300 

Otis  Elevatoi  Company 

1 

50 

Philco  Corporation 

26 

5.  000 

Pittsburgh!  Plate  Glass  Company 

3 

225 

The  Rand  Corporation 

1 1 

750 

Raytheon  Company 

49 

4,  000 

